Limitations and Alternatives for the Evaluation of Large-scale Link Prediction
نویسندگان
چکیده
Link prediction, the problem of identifying missing links among a set of inter-related data entities, is a popular field of research due to its application to graph-like domains. Producing consistent evaluations of the performance of the many link prediction algorithms being proposed can be challenging due to variable graph properties, such as size and density. In this paper we first discuss traditional data mining solutions which are applicable to link prediction evaluation, arguing about their capacity for producing faithful and useful evaluations. We also introduce an innovative modification to a traditional evaluation methodology with the goal of adapting it to the problem of evaluating link prediction algorithms when applied to large graphs, by tackling the problem of class imbalance. We empirically evaluate the proposed methodology and, building on these findings, make a case for its importance on the evaluation of large-scale graph processing. Graph Mining, Link Prediction, Evaluation Methodology
منابع مشابه
Link Prediction using Network Embedding based on Global Similarity
Background: The link prediction issue is one of the most widely used problems in complex network analysis. Link prediction requires knowing the background of previous link connections and combining them with available information. The link prediction local approaches with node structure objectives are fast in case of speed but are not accurate enough. On the other hand, the global link predicti...
متن کاملSecondary Use of Laboratory data: Potentialities and Limitations
Clinical databases have been developed in recent years especially during the course of all medical concerns including laboratory results. The information produced by the diagnostic laboratories have great impact on health care system with various secondary uses. These uses are sometimes as publishing new extracted information of laboratory reports which have been widely applied in the scientifi...
متن کاملA Link Prediction Method Based on Learning Automata in Social Networks
Nowadays, online social networks are considered as one of the most important emerging phenomena of human societies. In these networks, prediction of link by relying on the knowledge existing of the interaction between network actors provides an estimation of the probability of creation of a new relationship in future. A wide range of applications can be found for link prediction such as electro...
متن کاملEvaluation and Prioritization of Criteria Affecting the Selection of Landscape Species, Using Multi-Criteria Decision-Making Systems
It is impractical to implement conservation efforts for all species due to complexity of natural systems, large scale of biodiversity issues, and budget limitations. Prioritizing species of conservation importance can alleviate this issue. Multiple interrelated criteria may be used for conservation prioritization of species. Therefore, the accurate evaluation of criteria is a multi-criteria dec...
متن کاملAn Evaluation of Large-Scale Commercial Buildings Architectural Space Indicators with an Approach to Urban Threats and Risks
Abstract: Achieving to technical criteria that by applying them in architectural design stage of large commercial buildings, considered as one of the most important uses in urban planning, can increase the level of citizens’ security against all kinds of natural hazards and threats, and reduce vulnerability and it can provide continuation of the services and activities of these buildings. The p...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1611.00547 شماره
صفحات -
تاریخ انتشار 2016